Robust Covariance Matrix Estimation for High-Dimensional Compositional Data with Application to Sales Data Analysis
نویسندگان
چکیده
Compositional data arises in a wide variety of research areas when some form standardization and composition is necessary. Estimating covariance matrices fundamental importance for high-dimensional compositional analysis. However, existing methods require the restrictive Gaussian or sub-Gaussian assumption, which may not hold practice. We propose robust adjusted thresholding procedure based on Huber-type M-estimation to estimate sparse structure data. introduce cross-validation choose tuning parameters proposed method. Theoretically, by assuming bounded fourth moment condition, we obtain rates convergence signal recovery property method provide theoretical guarantees under setting. Numerically, demonstrate effectiveness simulation studies also real application sales
منابع مشابه
Methods for regression analysis in high-dimensional data
By evolving science, knowledge and technology, new and precise methods for measuring, collecting and recording information have been innovated, which have resulted in the appearance and development of high-dimensional data. The high-dimensional data set, i.e., a data set in which the number of explanatory variables is much larger than the number of observations, cannot be easily analyzed by ...
متن کاملFast covariance estimation for high-dimensional functional data
We propose two fast covariance smoothing methods and associated software that scale up linearly with the number of observations per function. Most available methods and software cannot smooth covariance matrices of dimension J > 500; a recently introduced sandwich smoother is an exception but is not adapted to smooth covariance matrices of large dimensions, such as J = 10, 000. We introduce two...
متن کاملRobust and sparse correlation matrix estimation for the analysis of high-dimensional genomics data
Motivation Microarray technology can be used to study the expression of thousands of genes across a number of different experimental conditions, usually hundreds. The underlying principle is that genes sharing similar expression patterns, across different samples, can be part of the same co-expression system, or they may share the same biological functions. Groups of genes are usually identifie...
متن کاملEstimation of the Covariance Matrix of Large Dimensional Data
This paper deals with the problem of estimating the covariance matrix of a series of independent multivariate observations, in the case where the dimension of each observation is of the same order as the number of observations. Although such a regime is of interest for many current statistical signal processing and wireless communication issues, traditional methods fail to produce consistent es...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Business & Economic Statistics
سال: 2022
ISSN: ['1537-2707', '0735-0015']
DOI: https://doi.org/10.1080/07350015.2022.2106990